Online Social Network Profile Linkage
نویسندگان
چکیده
Piecing together social signals from people in different online social networks is key for downstream analytics. However, users may have different usernames in different social networks, making the linkage task difficult. To enable this, we explore a probabilistic approach that uses a domain-specific prior knowledge to address this problem of online social network user profile linkage. At scale, linkage approaches that are based on a näıve pairwise comparisons that have quadratic complexity become prohibitively expensive. Our proposed threshold-based canopying framework – named OPL – reduces this pairwise comparisons, and guarantees a upper bound theoretic linear complexity with respect to the dataset size. We evaluate our approaches on real-world, large-scale datasets obtained from Twitter and Linkedin. Our probabilistic classifier integrating prior knowledge into Näıve Bayes performs at over 85% F1-measure for pairwise linkage, comparable to state-of-the-art approaches.
منابع مشابه
Online Social Network Profile Linkage Based on Cost-Sensitive Feature Acquisition
Billions of people spend their virtual life time on hundreds of social networking sites for different social needs. Each social footprint of a person in a particular social networking site reflects some special aspects of himself. To adequately investigate a user’s preference for applications such as recommendation and executive search, we need to connect up all these aspects to generate a comp...
متن کاملA Hybrid Model for Linking Multiple Social Identities Across Heterogeneous Online Social Networks
Automated online profiling consists of the accurate identification and linking of multiple online identities across heterogeneous online social networks that correspond to the same entity in the physical world. The paper proposes a hybrid profile correlation model which relies on a diversity of techniques from different application domains, such as record linkage and data integration, image and...
متن کاملDiscovery and Protection of Sensitive Linkage Information for Online Social Networks Services
This paper investigates the problem of suppressing access to sensitive linkage information over data published by users of an online social network service. We unveil the potential threats by inferring linkage information from the user-published data, and suggest a class of data publishing schemes to enable distributed data publication by individual users but hide the sensitive information. Our...
متن کاملSocial Network Data Analytics Social Network Data Analytics
The advent of online social networks has been one of the most exciting events in this decade. Many popular online social networks such as Twitter, LinkedIn, and Facebook have become increasingly popular. In addition, a number of multimedia networks such as Flickr have also seen an increasing level of popularity in recent years. Many such social networks are extremely rich in content, and they t...
متن کاملEuropean Journal of Open, Distance and E-Learning
The most productive learning experience for students whether online or in face-to-face classes can often be the interaction among students and with an instructor. Online teaching and Social Network Analysis (SNA) offer the opportunity to examine intellectual social networking and strategies that promotes student interaction which can enhance learning. This study focuses on two online courses in...
متن کامل